Part A - Domain - Telecom

1. Data understanding and Exploration

2. Data clearning and Analysis

2D.......... Insights on question 2C

There are totally 19 categorical value columns

  1. Gender - There is almost an equi balance of Male and Female customers
  2. Partner - There is again almost equi balance of Customers having partners and no partners
  3. Dependents - only 1/3rd of the Customers almost 30% have dependents while the remaining 70% have no dependents
  4. PhoneService - A huge 90% of the Customers have availed for Phoneservice while a small 10% have not opted for it
  5. Multiplelines - About 42% of the customers have multiple lines while 48% of the customers do nto have multiple lines
  6. Internetservice - A majority of Customers 44% have Fibre optic, 34% have DSL while 21% have no Internet servicce
  7. OnlineSecurity - 50% of customers have not availed Onlinesecurity and only 28% of the customers have taken this service,
                 21% in any case have no Internetservice
  8. OnlineBackup - Only 34.5% of the Customers have availed this service, 44% have not while 21% have no Internetservice at all
  9. DeviceProtection - Numbers almost look similar to the OnlineBackup 34.5 - yes, 44% no, 21% no Internetservice
  10. Techsupport - 29% of the customers have only signed up for this, 49% have'nt, while 21% have no Internetservice
  11. StreamingTV - Very equally distributed... 38% signed up, 39% not signed up, 21% no Internetservice
  12. STreamingMovies - Almost similar distribution as found in StreamingTV
  13. Contract - 55% of the customers going for month to month, 21% one year & 24% for two year contracts
  14. Paperless Billing - Almost 60% have opted for this, while the remaining 40% are for paper billing
  15. PaymentMethod - Credit card choice - 21.6%, Bank transfers - 22%, Electronic transfer - 35%, Mailed checks - 23%
  16. Churn - This is the Target variable - 73.5% look to be Loyal customers while 26.5% Churn out

3. Model building and Improvement

Part B - Domain IT